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THE MYTH OF PERFORMANCE-BASED TEACHER EDUCATION 



In recent years research and development in teaching has been 
strongly influenced by the idea of ''performance-based' teacher education* 
That is , ■ the identification of teacher skills that are related to 
increased student achievement and the corresponding development of 
teacher training programs and materials (see Gage, 197?)* Since 
hundreds of millions of dollars annually and substantial human resources 
are involved, the scientific basis for this movement is of considerable 
importance. In our opinion, an analysis of the research on the relation 
between teacher performance and student achievement fails to reveal 
an empirical basis for performance-based teacher education* 

The contribution by Rosenshine uud Furst, "Research on Teacher 
Performance Criteria" in the book Research in Teacher Education: 
A Symposium , 1971 (edited by B#0. Smith), reviews a carefully selected 
set of studies. " Rosenshine 's several writings on this and related 
topics establish him as knowledgeable of this ho6y of literature. 

Rosenshine. and Furst cite some 50 studies and propose 11 teacher . 
behavior variables which they regard as "the most promising of the 
variables studied*" The variables proposed are (l) clarity , 
(?) variability , (3) enthusiasm , (^) task-oriented and/or business - 
like behaviors , (5) student opportunity to learn criterion material , 
(6) use of student ideas and p;eneral indirectness , (7) criticism , 
(8) use of structuring comments , (9) types of questions , (10) probing , 
aitd (11) level of difficulty of instruction , in that order. 

The studies cited, and the teaching variables proposed, were care-; 
fully selected to represent the most conclusive and the "best" research 



to be found. Rosenshine and Furst say: 

The variables described below were abstracted from more 
detailed reviews (Rosenshine, 1970 a,b) and are the most 
promising of the variables studied* The variables with 
the strongest support are presented first# 

A description of the eleven strongest variables 
contained in this research is presented below together 
with the results obtained when these variables were studied. 
The best results were obtained on the first five" vari- 
ables; the results were less conclusive on the last six 
variables. The major focus is upon process-product studies, 
but experimental classroom studies are cited whenever 
appropriate. 

Unlike the myriad competencies outlined by' the 
model program builders, the skills listed below have some 
support in studies of naturally occurring teacher behav- 
ior and student achievement. 

The studies included in the Rosenshine and Furst review range/from 

major long-term research efforts to minor studies of very short 

duration* * 

We have carefully analyzed 4? of the studies cited for seven 

characteristics: (1) criterion of student achieve..j3nt ; {?.) operational 

definitions of teaching behavior ; (3 ) statistical results; (4) legiti-' 

macy of statistical tests (5 ) sample characteristics; (6) study 

conditions; and (7) comparabiltiy of groups. ' 



Method 

The method for gathering the information contained in the 
operational-definition table (Table l) and our criteria for determining 
significance claims follow. 

In searching for the author ' s operational definition of the teacher 
behavior under scrutiny, we referred to the manner in which Rosenshine 
and Furst cite the study. Often, Rosenshine and Furst quote the 
author's operational definition of the variable. For example, 
under clarity : "The investigators used different descriptions 



TABLE 1. OPERATIONAL DEFINITIONS OF TEACHER-PERFORMANCE AND FACE VALIDITY DECISIONS 



VALIDITY CLARITY 

YES "Clarity o( Presentation" on the Smnfoid Teacher Competence Appraisal Guide (seven-point rating scale). {Belgard, Rosenshine, & Gage, 1968) 
NO "Difficulty of Lesson": "Evaluaies the lesson in terms of its easu or difficultv for the children." (Chall & Feldmann, 1966) 
NO "Disorganized-Systematic" un an adjective checklist. "Organiiation of Lesson" and "Lesson Presentatiort" on the Stanford Teacher Competence 

■ Appraisal Guide {seven-point rating scale). (Fortune. 1967) . 
YES "Clarity of Presentation" on the Stanford Teacher Competence Appraisal Guide (seven pomt rating scale). {Fortune, Gage, & Shutes. 1966) 
YES "Were the points he (she) made clear and easy to understand?" "Were his {her) presentations usually well-organized?" "Coherence": the 

teacher's "artiCMl.^teness, his ability to make points which the students understand." {Solomon, Bezdek, & Rosenberg, 1963) 
YES Q-sort on the "Intellectually Effective vs. Intellsciually lneffef:tive" continuum. "The intellectually effective teacher . .. was the one who 

appeared to be able to explain concepts clearly and such that the students s?emed to be gaining understanding." (Wallen, 1966) 

CLARITY: LOW-INFEREtiCE ^/*AnlABLE S ' 

YES ■ "Vagueness Categories," including "ambiguous designation," "approximation," "indeterminate quantification," and "probability." (HtMer, 
Fisher; & Kaess. 1969) 

NO "Interpretation," 4 category for analysis of taped lessons, {.'^olomon, Bezdek. & Rosenberg, 1963) 
NO "Percent of questions leading' ID pupil response." (V'/right & Nuthall, 1970) 



. CLARITY: ORGANIZATION 

YES . "A typical itcn on the Disorganization ?cale is, 'There are long periods during which the class does nothing.'" The study focused on "the 
socio-emotional climaiy of the class" and on "using student perceptions of it as predictors of class learning." (Anderson & Walberg, 1968) 

YES "Organization of the Lesson" on the Stanford Teacher Competence Appraisal Guide (seven-point rating scale), (Belgard, Rosenshine. 
& Gage. 1968) 

YES "Disorg? nized-Systeniatic" on an adjective checklist. "Organization of Lesson" and "Lesson Presentation" on the Stanford Teacher Competer : 

Appraisal Guide (seven-point rating scale). (Fortune, 1967} 
YES "Organization of the Lesson", an item or the ?.tanfnrd Teacher Appraisal Guide, was not studied. (Fortune, Gage, and Shutes, t968) 
YES "Organization": if the teacher trie? "'to relate ideas, theories, current e*jents, etc. in an ordered, meaningful, coherent, aiia logical fashion . 

(Solomon, Bezdek, & Rosenberg, 1963) 
YES "Disorganization", a four-point scale on the Learniiig Environment Invnniory . "a measure of the social environment of learning." (Walb-erg, 1969) 
YES "Disorganized" on :he Classroorri Climate Que stionnaire. (Walberg & Andenson. 1968) 



VARIABILITY 



YES Listed teacher aids (e g., charts, film, radio, etc.) and pupil materials (e„g., magazines, paint, workbook, etc.) (Anthony, 1967) • 
NO "Inflexible-Adaptable" on an adjective checklist. (Fortune, 1967) 

NO "Multiple Cognitive Levels": "Ratio of number of lines devoted to analytic and evaluative substantive-logical processes to number of Hnes 

devoted io (inpirical logical process." (Furst, 1967) 
YES Answers to "Did the instructor follow up new and unexpected topics, suggested by student questions or suggestions?" and "Did he (she) 

sometimes nake changes in the way material was being presented^" (Solonr^rn, Bezdek, and Rosenberg, 1963/ 
NO. ."Verbosity"* "Frequency of verbal output." (Thompson and Bowers, 1968) 

NO "This school has a great deal of resource materials for extra study" and "This school offers extra learning facilities which include a library. 

audiovisu.i'l aids, etc." (Torrance and Parent, 1966) 
NO A Bio graphical Inventory including "books in home." "non-school reading," "musician," "career discussion with adults," etc. (Walberg, 1969) 
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VARIABILITY: FLEXIBILITY 



YES .■ Comparison of ratio of Indirect Influenced) to Direct Influence (D) across activity periods. (Flanders, 1970) 

YES "Flexibility": "... beginning with largest cell frequencies, a count of cells necessary to account for 60 percent of the tallies." (Soar, 1966) 
YES Flexibility, "The number of times a teacher changes behavior in .i given time interval." (Vorroyer, 1965) 

ENTHUSIASM ■ 



NO "Dull-Stimulating" cn an adjective checklist. (Fortune, 1965) 

NO "Dull-Stimuiating" and "Apathetic-Alert" on an adjective checklist. (Fortune, 1967) 

NO "Apathetic-Alert," "Dull-Stimulating," and "Stereotyped-Or'ginal" on an adjective checklist. (Kleinman, 1964) 

YES Teachers were asked to lecture in either an enthusiastic or unenthusiastic manner. (Mastin, 1963) 

YES "Energy" and "Envhusiasm" on a teacher behavior rating scale. "Was the instructor enthusiastic and excited about the subject rr , 
student questionnaire. (Solomon, Bezdek, & Rosenberg, 1963) ; 

Q "Dull vs. Stimulating" on a teacher behavior rating scale. (Wallen, 1966) 



TABLE 1. <Com.) 
VALIDITY 



TASK-ORIENTED AND/DR BUSINESSLIKE BEHAVIORS 



NO "Task-oriented, teachur-centeied" (e.g., "The teacher tells us exactly how to do our homework") vs. "Task-oriented, pupil-centered" (e.g.. "The 

teacher has us help each oiher in class") on a pupil survey. (Beiderman, 1964) 
NO Rating scales for "approach to learning; infornnation" and "approach to learning; thinking" on an observation inventory. (Chall & Feldmann, 1966) 
NO "Evading-Responsible/' "Erratic-Steady." "Disorganized-Systematic," and "Excitable-Poised" od an adjective checklist. (Fortune, 1967) 
NO "Evading-Responsible." "Erratic-Steady," Excitable-Poised." and "Disorganized-Systematic" on an adjective checklist. (Kleinman, 1964) 
YES Student ratings on "My teacher has encouraged this class to think of original solutions to n'rathematical problems," "In this class we have one of 

the most hard-working classes in the school," and "My teacher has encouraged this Jass to work out and have our own answers to problems." 

(Torranpe & Parent. 1966) 

YES ■ "Achieveinent Oriented vs. Non-Achievement Oriented" on an observer Q-sort, (Wallen, 1966) • 
YES Student teachers' grades in student. teaching depended on their students' achievement scores. (Wittrock. 1962) 



STUDENT OPPORTUNITY TO LEARN CRITERIDN MATERIAL 



YES "Substantive," defined as "subject matter of Instruction." (Bellack, Kliebard, Hyman, & Smith, 1966) 
YES "Total number of relevant words" in lecturu. (Rosenshine, 1968) 

YES V\ihether or not the lesson answered particular test items ("content relevance"). (Shutes, 1969) 



USE OF STUDENT IDEAS AND GENERAL INDIRECTNESS: USE OF STUDENT tPEAS 



YES "Accepts or uses ideas of pupils" on an observation record. (Flenders, 1970) 

YES "Accepts or uses student's answer .or idea" on observation record. (Perkins. 1965) 

YES • "Accepts or uses ideas of student" on observation record. (Soar, ;966) 

NO "Repetition of response" in tape analysis. (Wright & Nuthall, 1970) 



USE OF STUDENT IDEAS AND GENERAL INDIRECTNESS: INDIRECTNESS 



YES "Accepts or uses ideas of pupils" on an observation record. (Flanders. 1970) 

NO "High" or "Low" in emotional climate on an observation record (Higf> = warmth and friendliness). (Medley & Mitzel, 1959) 



USE OF STUDENT IDEAS AND GENERAL INDIRECTNESS: l/D RATIO 



VbS Ratio of "Indirect- Influence" to "Direct Influence." (LaShier, 1967) 



USE OF STUDENT IDEAS AND GENERAL INDIRECTNESS: EXPE RIMENTAL STUDIES 

YES Teachers were trained to create a "positive social-emotional climate" using Flanders I. A. (Carline, 1969) 

YES The experimental group was trainal tr> be more indirect in their teaching. (Gunnison, 1968) 

YES "Teacher-Centered" vs. "Pupil-Centered" treatments. (Herman, Potterfteld, Dayton, & Amershek, 1969) 

YES Teachers were asked to be either "Directive" or "Responsive." (Miller, 1966) 

CRITICISM - . 



YES "Intensity of negative support observed." (Anthony, 1967) 

YES "... bawling someone out ..." in classroom observation record. (Cook. 1967) 

YES "Criticizing or justifying authority" on observation record. (Flanders, 19701 

YES "Negative motivational statements" on observation record. (Harris. Morrison. Serwer, & Gold, 1968) 

NO "Teacher Control, " "A low degree of pupil disruptive acts during observation." (Hunter. 1969) 

NO "Threatens, warns" on observation, checklist. (Morsh, 1956) 

YES , . rejection or correction of student's response . . ." and 'criticizes or justifies authority . . ." on observation record, (Perkins, 1965) 

YES "Steady-state criticism" and "Pupil initiation following teacher criticism" on obseivation record. (Soar, 1966) 

YES "Manifest teacher disapproval" and "Emotional tone or mood of teacher" on observation record. (Spaulding, 1965) 

YES "Disparaging vs. non-disparaging" on rating scale and Q-sort; (Wallen. 1966) 

NO "Affirmative and Negative" comments on observation record. (Wright & Nuthall, 1970) 



USE OF STRUCTURING COMMENTS 



YES "Beginning the Lesson" and "Ending the Lesson" on the. Stanford Teacher Competence Appraisal Guide (a seven-point rating scale), (Belgard, 
Rosenshine, & Gage, 1968) 

YES "Beginning the Lesson" and "Ending the Lesson" on the Stanford Teacher Competence Appraisal Guide (a seven-point rating scale). (Fortune. 1967), 
YES "Teacher structure," "Deviation of individual teacher's amount of lines devoted to structuring." (Furst. 1967) 
YES- Number of "Verbal markers of importance" ir. Wesson transcript. (Pinney, 1969) 
No variables resembling "structuring" could be located. (Soar, 1966) 

"Structuring (total)," "teacher information following question," and "terminal structuring." (Wright & Nuthall, 1970) 



TABLE 1. (Cont,) 
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VALIDITY TYPES OF QUBSTIQNS 

NO "Communication," "Management;" and "Encouragement" were observed in classroom, (Conners & Elsenberg, 1966) 

YES Questions requiriny pupil understanding of the meaning vs. those, requiring only recognition of the symbol involved, (Harris. Morrison. Serwer. 
«4 Geld. 1968) 

YES Seven question categories: neutrai, rhetorical, factual, clarifying, associative, critical thinking, and values. (Kleinman. 1964) 
YtS "Asks questions about content . . ." vs. "Asks questions that stimulate thinking . . ." (Perkins. 1964) 

YES Questions were categorized as "organizing," "hypothetical." "opinion." "factual." "interpretation." or "personal reference," (Solomon, Bezdek. 
& Rosenberg. 1963} 

YES "Technique - eliciting a verbal response in an open-ended fashion." and "Technique — eliciting <i specific or solution that the teacher has in 

mind" on a rating scale, (Spaulding. 1965} 
NO "Convergence-divergence" on an observation record, {Thompson & Bov;ers. 1963) 
YES "Closed questions" and "open questions" on an observation record. (Wright & Nuthali. *970) 



^ PROBING 

YES "Teacher encourages interpretation, generalization, solution" on an observation record. (Soar. 1966) 

YES "Technique — eliciting cfarification in a non-throgtening way" on an observation record, (Spaulding. 1965} 

YES "Redirects question." "reciprocates to lift." and "reciprocates to extend" on an observation record. (Wright & Nuthali, 1970) 



^ LEVEL OF DIFFICULTY OF iNSTRUGTION 

YES • Students rated teachers in hypothetical situations as using "moct strict standard," "most liberal standard." or "somewhere in between." 
(Nikoloff. 1965) 

NO The variable referred to by Rosenshine & Furst could not do located. {Torr3n£:c & Parent. 1965) 
YES "Difficulty" on L earnin:; Environment Invonron/ . (Walberq. 1969) 



of clarity : 

1. 'Clarity of Presentation ' (Belgard, Rosenshine^ and Gage^ I968; 
Fortune, 196?; Fortune , Gage, and Shutes, I966)...." (p. kk) . If 
this was the case, we simply quote this definition from the text of 
the study and note it in the table.. 

In many other cases,. Rosenshine and Furst merely ■ describe the 
author's operational definition. For example, on enthusiasm ; "Teacher 
enthusiasm has been assessed by .observer estimation of the amount 
of vigor and power exhibited by the teacher during classroom presenta- 
tion. ..." (p. ^6) . In these cases, we attempted t-:^ locate the teaching 
variables in each study which most closely eorresponded to Rosenshine 
and Furst 's description. 

In a few cases, we were unable to locate the teaching variable 
described by Rosenshine and Furst. These cases have been noted in the 
table. 

In deciding whether or not the author claimed significant results, 
we concentrated on the tables and discus.sion of results in each study. 
If the results table(s) specifically indicated significance levels 
for the variable in question, we regarded this as a claim of significance. 
If indications of significance were absent from the tables, but were 
present in the -discussion of results, we regarded this as a claim of 
significance as well. In several studies, an explicit claim of 
significance was absent from both tables and discussion of results, 
but the discussion conveyed the impression that the author was claiming 
significance. Thus, we had three criteri^i for concluding that the 
author claimed significance: (l) .whether or not he reported signifi- 
cance levels in tables of results, (2) whether or not he claimed signifi- 
cance in the text of his report, or (3) whether or not the significance 



of results was implicit in his discussion. If a study met any one of 
• these crit / ; we concluded that the author was claiming signif icance# 
Simil:.':- teria . were used in the cases where we concluded that 
the author did not claim significant results. , V/e reached this conclu- 
sion when the relevant values in the results tables did not reach 
significance levels specified in the tables, when the author reported" 
non-significant results in his discussion, or when the non-significance 
of results was implicit in his discussion. 

Clearly, there is a difference between the statement that the 
author claimed significant results and- the statement that the author 
obtained significant results* The fact that we were concerned with 
the former staLement means that we were more -stringent in our judgments 
concerning the significance of results than were Rosenshine and P'urst. 
In the cases where Rosenshine and Furst report that the author found 
significant results, but we concluded that the author did not claim 
significance, Rosenshine and Furst looked up the significance of the 
reported values themselves. In view of other design considerations, 
we believe that this is an unwarranted procedure. Most of the studi'3s, 
for instance, failed to report the tenability of critical assumptions. 
If these assumptions were not met , the reported values in these studies 
could be spuriously high or low-. The original authors are likely to 
have very good reasons for not claiming significance. These reasons 
are frequently not made explicit in the reports. 

• The criteria used for analyzing the design and methodology of 
the studies are explained in detail in the Teclmical Appendix of our 
larger report, "An Analysis of Selected Studie.s of Teacher Effectiveness. 
Each study was analyzed for such research desiajn. features as random 
assignment, number of teachers and number of students per teacher, 

ErIc 
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adjustment of post-test scores for initial differences , experimental 
independence of variables, and significance or non-significance 
claims. The studies were also "analyzed for the- legitimacy of their 
stati stical procedures : assumptions of linear relationships , parallel 
regression slopes, normality, homogeneity of variance, method of 
estimating the slope used in adjusting post-test scores, and the 
effect of extreme data points ("outliers"). 

Results 

The results of the analysis are summarized in Table 1 and Table 2« 
Table 1 lists each author's operational definition of the teaching 
variables cited by Rosenshine and Furst. Our judgment concerning the . 
appropriateness of the operational definition to the variable cited 
is indicated. In our judgment, 26 of the 84 operational definitions 
of teacher behavior do not correspond to the variable cited. 

Table 2 lists the studies, the variables for which they are citedi- 
in Rosenshine and Furst 's review, and a number of their basic charac- 
teristics. 

Fifty-eight citations used the teacher as the unit of sampling 
(79';^) and 10 (13?^) used the pupil. In the other 10 cases (13%) i 
the information reported was not sufficient to determine the. unit of 
sampling. 

One citation reported a linear relation (1^) and 6l did not 
report- having tested the assumption of linearity (78%). Linearity is 
not relevant to the remaining I6 citations (21/^). 

Five citations report randomly assigned .subjects to treatments 
(6?o) and 65 did not randomize (85?^). Eight did not report c.^ough 
information for a judgment (11%). 

O 
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TABLE 2. DESIGN AND METHODOLOGY CHARACTERISTICS 
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Eleven studies used a pooled within-group estimate of regression 
slope in adjusting for initial ability (l^fO, while ^1 did not (53^)* 
In five cases, there was insufficient information reported to make 
a jud£raent (6?0. The remaining ?1 citations either did not adjust 
for initial ability or did not report statistical tests for the variable 
in question {?!%). 

One citation reported that there was no treatraent-slope interaction 
(r;^), 56 did not report having tested the assumption of parallel 
slopes for all treatment groups (7?%), and ?1 either did not adjust 
for initial ability or did not report ^:<tatistical tests for th€ 
variable in question • 

Fifty-eight citations (7^%) did not report the disposition of 
outliers (extreme data points). This characteristic does not apply 
to the remaining 20 citations (265o). 

Sixty citations, included at least 15 classes (77^)j and eight 
reported less than 1,5 classes (10^). Ten citations did not report 
sufficient information for a judgment to be made • 

Sixty-one citations included at least 15 pupil;3 per class (78?^), 
and five included less than 15 per class {6%). In two cases, there 
was insufficient information ( , and in the remaining 10 cases, no 
statistical tests were performed for the variable in question {\J)%) * 

Seventy citations did not report whether the assumption of a 
normal distribution had been tested (90%). The remaining eight did 
not report performing statistical tests for the variables in question 
(10%). 

The covariate was found to be independent of teaching behavior 
in 15 studies (19%) • It was not ihdependent of teaching behavior in 
?2 cases (28%). There was insufficient information for judgment in 



25 citations (32^.)« The category does not apply to the remaining 
16 cases (21%). 

Testing for homogeneity of variance was not reported in 68 studies 
(8?^)« The assumption does not apply to the remaining 10 cases (13%) • 

Thirty-three of the reports claimed a significant relation between 
the variable in question and some criterion measure {h2%) * The other 
^5 citations did not claim significant results (58^^)* 

Discussion 

These results lead us to conclude that an empirical basis for 
performance-based teacher education does not exist* The conception, 
design, and methodology typical of these studies pr . : ides their use 
as such a basis. 

The operational definitions of teaching shown in Table 1 do not 
correspond, in many instances, to the names of the teaching variables 
cited* For example, a rating of "difficulty of the lesson" (in Ghall 
and Feldmann, I966) is cited as the basis for a Btudy of "clarity.*' 
.Nearly a third of the operational definitions studied seem similarly 
inappropriate • 

Further, operational definitions of teacher behavior that have 
little in common are of ten combined as examples of a single teaching 
variable. Note that a rating of "organization of the lesson" 
(Belgard, Rosenshine, and Gage , I968) ' is combined v/ith "disorganized" 
as a characteristic of the classroom climate. .(V/alberg and Anderson, 
1968) under the single variable "organization." 

Beyond these logical difficulties, the operational definitions 
reported in the studies generally reflect a picture of -the teacher in 
a traditional lecture-^discussion role and are usually defined so - 
vaguely as to be of little use in training teachers.'* "Clarity of 
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presentation*' on a seven-point rating scale (Fortune, Gage, and 
Shutes, 1966) assumes that teaching requires a "presentation.'' A 
.rating on "clarity," with no explication of what behavior "clarity" 
involves, does not seem useful. Continued research on such sterile 
definitions of teaching seems unlikely to provide a basis for training 
teachers • 

The operational definitions of student achievement are similarly 
shallow. For example, the criterion of student achievement used in 
several studies is a ten-question multiple-choice test of information 
based on Atlantic magazine articles on economic, political, and social 
conditions in Yugoslavia published between November 196^ and August 1965« 

The research design of raost of these studies is fundamentally 

weak. Only five of the 78 citations report using random assignment 

of pupils to treatments , yet- they employ statistical analyses whose 

interpretation implies randomisation. As Lord (I963) has said: 

If the individuals are not assigned to the treatments at 
random, then it is not helpful to demonstrate statisti- 
cally that the groups after treatment show more difference 
than would be expected by random assignment.... 

In most (^5 of 78) of the studies cited, the author did not " 
claim to. find a significant relation between the teacher-performance 
variable and student achievement . Further, in many of these studies 
negative or clearly non -significant relations are reported. These 
negative results are ignored. 

Though most studies employed adjusted post-test means as criteria, 
only about 10% used the preferred method of adjustment. 

Aside from questions of research design, the statistical analyses 
reported make interpretations of the- studies extremely undependable. 
Critically important assumptions such as linearity, normality, homo- 
geneity of variance, and parallel slopes are almost universally 
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unreported (see Table ?.).• There is reason to believe that in many 
instances these assumptions are not , in fact^ tenable. With no 
information, one might choose to believe that a particular basic 
assumption was met, or could somehow be disregarded in a particular 
study. The compound effect of unreported and unmet assumptions is 
another matter. For example, homogeneity of variance and normality 
are not independent assumptions; if one is not met, the other becomes 
even more critical. One cannot, with scientific integrity, dismiss 
s.uch assumptions and interpret the results at will. 

One of the pitfalls commonly encountered in this literature is. 
the item-factor fallacy in factor ana.lysis. Posenshine and Furst 
report significant results in a small number of studies employing 
factor analysis.- When a relevant item of teacher-behavior loaded 
on a factor that was significantly correlated to achievement^ they 
claim a significant relationship between the item and achievement. 
A significant correlation between a factor and student achievement, 
however, does not necessarily imply significant correlations between 
achievement and every teacher-behavior item loading on that factor* 

The performance-based-teacher-education model does not treat 
two important types of variables. It ignores what is to be taught* j; 
Though the studies reviewed here v/ere concerned with everything .from 
rj»ircraft mechanics to reading, no effort is apparent in identifying 
the possible interactions between teacher-behavior variables end 
content. Is there one set of best teaching behaviors for teaching 
everything ? If there is an important interaction between content 
and teaching behavior ( given cognitive achievement as criteria), 
then the conclusions about what behavior is effective may be determined 
as much by content as by teacher behavior. 
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The model ignores who is to be taught- Despite persistent 
evidence that variables such as socio-economic status and race, are 
more important determinants of achievement level than teacher behavior,, 
the research on teacher-behavior variables largely ignores such differ- 
ences among students. Similarly , the studies cited by Rosenshine and 
Furst cover a- wide student age range (pre-school to adult), yet the 
idea that effective teacher behavior might be different for different 
age groups is ignored when conclusions are drawn from such collections. 
Is there one set of best teacher behaviors for teaching everything 
to everybody ? 

Other Reviews 

The studies cited by Rosenshine and Furst were selected as the . ' 
best and most conclusive available. Since our analysis and interpre- 
tation of these studies leads us to doubt the case for demonstrated 
relations between teacher-behavior variables and student achievement , 
it seems prudent to look at the conclusions that have been stated 
by others who have reviewed the research on this subject. 

In 1958, Brim (1958, p- 3^) concluded, after reviewing the status 

of the research: 

However, although there is a vast body of research 
on the relation of teacher characteristics to effective- 
ness in teaching, the reviews of this research. ••show 
no consistent relation between any characteristics , 
including intelligence, and such teaching effectiveness. 

In 1967, Stephens (I967, p. 83-8^ ) concluded : 

It is always possible that the vast number of inves- 
tigations have "just happened'' to hit on the more intransient 
areas of growth. It is also possible that they have just 
happened to miss the areas which would have shown the effects 
of the device being tested. But these possibilities are 
not very comforting. The argument implies that some 
systematic force .has been at work to keep our sample a 
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biased sample. ' The argument rests on the assumption that 
the unmeasured areas of character growth are more responsive 
to the devices being tested than the more measurable areas 
of academic achievement • Actually, that assumption is 
very dubious, a point which will be developed at some . 
length in Chapter 9» 

The other four arguments are all confronted by one 
very, serious obstacle^ The fact is, insensitive as the 
tests may be and over-controlled or under-controlled as 
some experiments probably are and exacting as standards 
undoubtedly are, a great deal of growth does appear and 
does meet the standards. The investigations cited do not 
fail to reveal growth. They merely fail to reveal differ- 
ences in growth attributable to the administrative (teaching) 
variables. If we use other variables , such as background 
factors , moreover , marked differences in growth also come 
through. If the tests, and the designs^ and the criteria 
.of significance permit such differences to appear, it 

is difficult, to see why they should not also permit differ-- 
ences in administrative (teaching) factors to come through 
if these were presents 

In 1968, Dubin and Taveggia (I968, p. 35) concluded, after reviewing 

91 comparative studies of college teaching technologies conducted 

between 19^^ and I965: 

These data demonstrate clearly and unequivocally that 
there is no measurable difference among truly distinctive 
methods of college instruction when evaluated by student 
performance on final examinations. 

Mood (1970, 7) concluded: 

We can only make the not very useful observation that 
a t the present moment we cannot make any sort of meaning- 
ful quantitative estimate of the effect of teachers on 
student achi evement. 

Gage (1972) reports a series of reviews that come to similar 
conclusions:; Committee on the Criteria of Teacher Effectiveness 
(1953), Wallen and Travers (1963) Getzels and Jackson (1963), 
and Withall and Lewis (I963). 



Conclusions 

Our analysis of this literature leads us to three conclusions: 
First, the research literature on the relation between teacher 
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performance and student achievement doe^ not offer an empirical basis 
for the prescription of teacher-training objectives. 

Second^ this literature fails to provide such a basis, not because 
of minor flaws in the statistical • analyses, but because of sterile 
operational definitions of both teaching and achievement , and because 
of fundamentally weak research designs. 

Last, given the well-documented strong association between student 
achievement and variables sucli as socio-economic status and race (see 
Coleman and others, I966), the effects of techniques of teaching on 
achievement (as these variables are conventionally defined) are 
likely to be inherently trivial* 



We appreciate 
Flanders and Barak 



the generous support and assistance 
Rosenshine in this investigation. 



of Ned A. 



17 



REFERENCES 



American Educational Research Association, Committee on the 
Criteria of Teacher Effectiveness, 1952. Report of 
the*. .Review of Educational Research, 22: ?38-263. 

Anderson, G. J., and H. J. Walberg, I968. "Classroom Climate 

and Group Learning.'' International Journal of Educational 
.Sciences, 2:175^80. . ' 

Anthony, B, C. M. , 196?. "The Identification and Measurement 
of Classroom Environmental Process Variables Related to 
Academic Achievement." Unpublished doctoral dissertation. 
University of Chicago*. 

. Beiderman, D» D. , 196^. "Relationship between Teaching Style 
and Pupil Behavior." Unpublished doctoral dissertation. 
University of California at Los Angeles (University Mic- 
rofilms Noo 6^1^12, 180). 

Belgard, M., B. Rosenshine , and N.'L- Gage, I968. "The Teach-- 
er's Effectiveness in Explaining: Evidence on Its Gener- 
, ality aiid Correlation with Pupil's Ratings and Attention 

' Scores," in N. L. Gage et al. , Explorations of the Teach- 

er's Effectiveness in Explaining. Stanford: Stanford 
Center for Research and Development in Teaching, . Stanford 
University (Technical Report No.Mf)* 

Beilack^ A* A., H. Kleibard, R. T. Hyman, and F. L. Smith, 

1966, The Language of the Classroom . New York: Columbia 
University Press* . 

Brim, 0. G. , 1958. S^oc iology and the Field of Education . 
New York: Russell Sage Foundation. 

Carline, J. L. ,1969. "An Investigation of the Relationship 
between Various Verbal Strategies of Teaching Behavior 
and Achievement of Elementary School Children." Unpub- 
lished doctoral dissertation, Syracuse University. 

Chall, J. . S. V ^^nd S. C, Feldmann, I966. A Study in Depth'of 

First G rade Reading . New York: The City College of the 
City University of New York (U. S. Office of Education 
Cooperative Research Project No. 2728). 

Coleman, J»S», and otherSt 1966, Equality of Educational 
Opportunity ^ Washington, D. C. : U. S. Government 
Printing Office- 



18 



Conners, C» K. , and L» Eisenberg, 1966, The Effect of Teacher 
Behavior on Verbal Intelligence in Operation He a dstart 
Children > Baltimore: Johns Hopkins University School 
of Medicine (U. S» Office of Economic Opportunity Head- 
star t Contract No* 510). 

Cook, S. E. , 1967. "The Effect of Teacher Methodology upon 
Certain Achievements of Students in Secondary School 
Biology.^' Unpublished doctoral dissertation, University 
of Iowa, Iowa City. . 

Dubin, R. , and T. C. Taveggia, 19680 The Teacher-Learning 
Paradox; A Comparative Analysis of College Teaching 
Methods. Eugene, Ore.: Center for the Ad.vanced Study 
of Educational Administration, University of Oregon. 

Elashoff, J. D. , 1969. "Analysis of Covariance: A Delicate 
Instrument." American .Educational Research Journal, 
6:383-^01. 

Flanders, N. A., 1970. Analyzing Teaching Behavior . 
New York: Addison-Wesley Co. 

Fortune, J. C. , I966. The Generality of Presenting Behaviors 
in Teaching Pre-School Children . Memphis, Tenn. : 
Memphis State University. 

« 1967» A Study of the Generality of Presenting Behav -. 

iors in Teaching . Memphis, Tenn.: Memphis State Uni- . 
versity (U. S. Office of Education Project No. 6-8^68). 

L. Gage, and R. E. Shutes, I966. The Generality 
of the Ability to Explain . Paper presented to the 
American Educational Research Association, February 1966. 
Amherst: University of Massachusetts , College of Edu- 
cation. 

Furst, N. F. , 1967.. The Multiple Languages of the Classroom . 

Paper presented at the meeting of the American Educational 
Research Association, February 1967* Also available 
as an unpublished doctoral dissertation. Temple Univer- , 
. sity, Philadelphia, 1967. 

Gage, N. L. , 1972. Teacher Effectiveness and Teacher Education . 
Palo Alto , Calif. : Pacific Books. ~ ~" 

Getzels, J. W. , and P, W. Jackson, 1963* "The Teacher's Person-' 

ality and Chax^acteristics," in N. L. Gage (Ed.), 

Handbook of Research on Teaching . Chicago: Rand McNally. 

Pp. 505-82. ~ ■ . 



ERIC 



19 



Gunnison, J. P*? I968. "An Experiment to Determine the Effects 
of Changing Teacher Glassroora Behavior through Training 
of Student-Teachers in the Use of the Flanders' Interaction 
Analysis System." Unpublished doctoral dissertation, 
Arizona Stc?.te University, Terape, Arizona, 

Harris, A. J., et al., 19o8. A Continuation of the CRAFT Pro - 
ject: Comparing Reading Approaches with Disadvantaged 
Urban N egro Ghaldren in Primary Grades , U. S. Office 
of Education Project No, 5-.0570-P-12-1. New York: 
Division of Teacher Educa tion of the City University 
■ of Mew York. 

Herman, V/, L. , Jr,. , et al* , 1969* "The Relationship of Teacher- 
Centered Activities and Pupil-Centered Activities to Pu- 
pil Achievement and Interest in I8 First-Grade Social 
Studies Classes . " American Educational Research Journal, 

Hiller, J. H* , G. A/ Fisher, and Kaess, 1969. ''A Computer 
Investigation of Verbal Characteristics of Effective 
Classroom Learning. " American .Educational Research 
■ Journal, 6: 661-77, 

Hunter, C. P. ^ I969. "Classroom Climate and Pupil Characteris-- 

tics in Special Classes for the Educationally Handicapped*" 
Unpublished doctoral dissertation, University of Southern 
California, Los Angeles. 

Kleinraan, a*, 196^f. "General Science Teacher's Questions, 

Pupil and Teacher Behaviors, and Pupils' Understanding 
of Science." Unpublished doctoral dissertation. Univer- 
sity of Virginia, Chariot tesville , Virginia (University 
Microfilms No. 65~396l). 

LaShier, V/. S., Jr., I967. "The Use of Interaction Analysis 

in BSCS Laboratory Block Classrooms. " Journal of Teacher 
Education, 18: '+59-^6 

^Mastin, V.r £• , 1963» "Teacher Enthusiasm/" Journal of Educa- 
tional Research, 56: 385-86, 

Medley, D. M. , and H, E- Mitzel, 19?9. "Some Behavioral Cor- 
relates cf Teacher Effectiveness." Journal of Educational 
Psychology, 50:239-^6. 

Miller, G. L. , I96S. "Collaborative Teaching and Pupil Think 
ing." Journal of Teacher Education, 17:337-58. 



20 



Mood, A., 1970. ''Do Teachers Make a Difference?" in 
Do Teachers Make a Difference? A Report on Recent Re - 
search on Pupil Achievement , Washington, D. C. : Bureau 
of Educational Personnel Development, Office of Educa- 
tion. Pp. l-2h. 

Morsh, J. E. , 1956, Systematic Observation of Instructor 
Behavior . Development Report No, AFPTRC-TN-56-5?. 
San Antonio: Air Force Personnel and Training Research 
Center, Lackland Air Force Base. 

Nikoloff, S. E. B. , I965. "The Relationship of Teacher Stan- 
dards to the Written Expression of Fifth and Sixth Grade 
Children." Unpublished doctoral dissertation. State 
University of New York at Buffalo, Buffalo, New York 
(University Microfilms No. 66-35?0). 

Perkins, H. V., 1964. "A Procedure for Assessing the Class- 
room Behavior of Students and Teachers." American Edu- 
cational Research Journal, 1:2^0-60. 

, 1965» "Classroom Behavior and Under-Achieveraent. " 

American Educational Research Journal, 2:1-12* 

Pinney, R. H. , I969. "Presentational Behaviors Related to 

Success in- Teaching. " Unpublished doctoral dissertation 
Stanford University. 

Rosenshine, B. , I968. "Objectively Measured Behavioral Pre- 
dictors of Effectiveness in Explaining." Stanford, 
Calif.: Technical Report No. 4, Stanford Research and 
Development Center in Teaching. 

^ and N. F« Furst, 1971 • "Research on Teacher Perfor- 
mance Criteria." In B. 0. Smith, (Ed.), Research in 
Teacher Education; A Symposiujn . Englewood Cliffs, 
N. J.: Prentice-Hall.- Pp. 37-72o 

Shutes, R. E. , I969. "Verbal Behaviors and Instructional 
Effectiveness." Unpublished doctoral dissertation, 
Stanford University. 

M. , 1967. The Process of Schooling . New York: 
Rinehart, and Winston. 

Soar, S. , 1966. An Integrative Approach to Classroom Learn 
ing , Philadelphia, Pa. : Temple University (Final Re- 
port, Public Health Service Grant No. 5-Rll-MH OIO96 
and National Institute of Mental Health Grant No. 7-RII- 
MH 02045). (ERIC ED: 033-^79)- 



Solomon, D, , W. E. Bezdek, and L, Rosenberg, 1963* Teaching 
Styles and Learning * Chicago: Center for the Study of. 
Liberal Education for Adults. (ERIC ED: 026-556). 

Spaulding, R. L. , 1965» Achievement, Creativity, and Self- 
Concept Correlates of Teacher-Pupil Transactions in 
Elementary Schools . Hempstead, New York: Hofstra 
University (U. S. Office of Education Cooperative Re- 
search Pro ject No» 1352)* 

Thompson, G. R. , and C. Bowers, 1968. Fourth Grade Achieve - 
ment as Related to Creativity, Intellij^ence , and Teaching 
Style . Paper presented to the American Educational 
Research Association^ Chicago, February I968. 

Torrance , E. P. , and E. Parent, I966. Characteristics of 
Mathematics Teachers that Affect Students ' Learning s. 
U. S. Office of Edvtcation Cooperative Research Project 
No, 1020, Contract No, SAE-8993. Minneapolis: Minne- 
sota School Mathematics and Science Center, Institute 
of Technology, University of Minnesota. 

Vorreyer, D, F. , 1965« "An Analysis cf Teacher Classroom Beha- 
vior and Role./' Unpublished doctoral dissertation. 
University of Maryland, College Park.. 

V/alberg, H. J., I969- "Predicting Class Learning." American 
Educational Research Journal, 6:529-^3* 

, and G. J. Anderson, I968- "Classroom Climate and 

Individual Learning." Journal of . Educational Psychology, 
59:^1^^-19- 

V/allen, N. E. , I966. Re lationships between Teacher Character - 
istics and Student Behavior: Part Three . U. S; Office 
of Education'^ Cooperative Research Project No. SAEOE 5- 
10-181. Salt Lake City: University of Utah. 

^ and R. M. W. Travers, 1963. "Analysis and Investi- 
gation of Teaching Methods," in N. L. Gage (Ed.), 
Handbook of Research on Teachin g. Chicago: Rand McNally. 
Pp. ^58-505. 

Withall, J., and W.. W. Lewis, I963. "Social Interaction in 

the Classroom, " in N. L. Gage (Sdo) Handbook of Research 
on Teaching . Chicago: Rand NcNally'li Pp. 683-71^« 

Wittrock, M., I962. "Set Applied to Student Teaching." Journal 
of Educational Psychology, 53:175-80* 



22 



Wright, C. J., and G. Nuthall, 1970. "The Relationships be- 
tween Teacher Behavior and Pupil -Achievement in Three 
Experimental Elementary Science Lessons." American 
Educational Research Journal, 7:^77-91. 



